Anthropomorphic feature extraction algorithm for speech recognition in adverse environments
نویسندگان
چکیده
Speech recognition engines should remain reasonably accurate in adverse environments in order to find their ways from laboratories towards applications. However the human auditory system has been proven to be a versatile tool, which is capable of outperforming the known artificial algorithms in their target environments. Recent advances in psychoacoustics and auditory physiology pointed to the essentially non-linear behaviour of the auditory apparatus. On the basis of the interpretation of the biological information processing it is possible to construct a parametric “human-like” nonlinear algorithm, which exhibit properties similar to those of the live system. Besides the description of the anthropomorphic feature extraction algorithm in this paper we test its performance in accordance with the formulated requirements to the efficient and robust feature extraction and also provide a comparative benchmark of compact ASR system in combination with the proposed algorithm in adverse conditions.
منابع مشابه
A high-performance auditory feature for robust speech recognition
An auditory feature extraction algorithm for robust speech recognition in adverse acoustic environments is proposed. Based on the analysis of human auditory system, the feature extraction algorithm consists of several modules: FFT, outer-middle-ear transfer function, frequency conversion from linear to Bark scales, auditory filtering, nonlinearity, and discrete cosine transform. Three recogniti...
متن کاملAdvanced front-end for robust speech recognition in extremely adverse environments
In this paper, a unified approach to speech enhancement, feature extraction and feature normalization for speech recognition in adverse recording conditions is presented. The proposed frontend system consists of several different, independent, processing modules. Each of the algorithms contained in these modules has been independently applied to the problem of speech recognition in noise, signi...
متن کاملتشخیص لهجه های زبان فارسی از روی سیگنال گفتار با استفاده از روش های استخراج ویژگی کارآمد و ترکیب طبقه بندها
Speech recognition has achieved great improvements recently. However, robustness is still one of the big problems, e.g. performance of recognition fluctuates sharply depending on the speaker, especially when the speaker has strong accent and difference Accents dramatically decrease the accuracy of an ASR system. In this paper we apply three new methods of feature extraction including Spectral C...
متن کاملFrequency-domain auditory suppression modelling (FASM) - a WDFT-based anthropomorphic noise-robust feature extraction algorithm for speech recognition
This paper presents a physiologically inspired feature extraction algorithm for employment within the speech recognition engines, which are supposed to remain effective in noisy environments. Essentially, the algorithm simulates a key property of the “active cochlea” models – a signal dependent variable gain over the frequency range. In order to drastically reduce computational complexity of th...
متن کاملImproving of Feature Selection in Speech Emotion Recognition Based-on Hybrid Evolutionary Algorithms
One of the important issues in speech emotion recognizing is selecting of appropriate feature sets in order to improve the detection rate and classification accuracy. In last studies researchers tried to select the appropriate features for classification by using the selecting and reducing the space of features methods, such as the Fisher and PCA. In this research, a hybrid evolutionary algorit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004